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Abstract — The high-signal-to-noise ratio (SNR) asymptotic be- 
havior of the mutual information (MI) for discrete constellations 
over the scalar additive white Gaussian noise channel is studied. 
Exact asymptotic expressions for the MI for arbitrary one- 
dimensional constellations and input distributions are presented 
in the limit as the SNR tends to infinity. Using the relationship 
between the MI and the minimum mean-square error (MMSE), 
asymptotics of the MMSE are also developed. It is shown 
that for any input distribution, the MI, MMSE and symbol- 
error probability have an asymptotic behavior proportional to 
a Gaussian Q-function whose argument depends only on the 
minimum Euclidean distance of the constellation. The developed 
expressions are then used to study the high-SNR behavior of the 
generalized mutual information (GMI) for bit-interleaved coded 
modulation (BICM). The long-standing conjecture that, at high 
SNR, Gray codes are the binary labelings that maximize the 
BICM-GMI, is proven. It is also shown that for any equally 
spaced A/-ary constellation, there always exists an anti-Gray code 
that gives the lowest BICM-GMI at high SNR. 

Index Terms — Anti-Gray code, additive white Gaussian noise 
channel, bit-interleaved coded modulation, discrete constellations. 
Gray code, minimum-mean square error, mutual information, 
high-SNR asymptotics. 

I. Introduction 

In this paper we consider the additive white Gaussian noise 
(AWGN) channel 
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where X is the transmitted symbol and Z is a Gaussian 
random variable, independent of X, with zero mean and unit 
variance. The capacity of the AWGN channel in (HJ is given 
by Q 

^ (2) 



C(p) = -log(l + 7) 

where 7 is the signal-to-noise ratio (SNR). Although inputs 
distributed according to the Gaussian distribution attain the 
capacity, they suffer from several drawbacks which prevent 
them from being used in practical systems. Among them, 
especially relevant are the unbounded support and the infinite 
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number of bits needed to represent signal points. In practical 
systems, discrete distributions with a bounded support are 
typically preferred. 

The mutual information (MI) where the input distribution 
is constrained to be a probability mass function (PMF) over 
a discrete constellation represents the maximum rate at which 
information can be reliably transmitted using that particular 
constellation. The MI for the AWGN channel as a function 
of the SNR for an arbitrary input PMF involves mixtures 
of discrete and continuous distributions, and thus, is difficult 
to analyze. To overcome this problem, this MI is typically 
assessed asymptotically at low or high SNR. 

While the low-SNR asymptotics of the MI for discrete 
constellations are well-understood (see |IT]-||4| and references 
therein), to the best of our knowledge, only upper and lower 
bounds are known for the high-SNR behavior For example, 
upper and lower bounds on the MI and/or the minimum mean- 
square error (MMSE) in the high-SNR regime were derived 
in Is)-!?!. It was argued in ||6l p. 1073] that for discrete 
constellations maximizing the MI is equivalent to minimizing 
both the symbol-error probability (SEP) and the MMSE. In 
|[8] Appendix E] two constellations with different minimum 
Euclidean distances (MEDs) are compared, and it is shown 
that, for sufficiently large SNR, the constellation with larger 
MED gives a higher MI. Upper and lower bounds on the MI 
and MMSE for multiple-antenna systems over fading channels 
can be found in ll9l-llT2l. 

In this paper, we study high-SNR asymptotics of the MI 
for discrete constellations. In particular, we consider arbitrary 
constellations and input distributions for the channel in ([T]), 
and find exact asymptotic expressions for the MI in the 
limit as the SNR tends to infinity. Using the MI-MMSE 
relationship established in |[T3l (see also lfT4l Ch. 2]), exact 
asymptotic expressions for the MMSE are also developed. We 
prove that for any constellation and input distribution, the MI, 
MMSE, and SEP have an asymptotic behavior proportional to 
Q (yp(i/2), where Q(-) is the Gaussian Q-function and d is 
the MED of the constellation. In terms of the proportionality 
constant, and for a uniform input distribution, these results 
show that the constellation that maximizes the MI is the same 
that minimizes both the MMSE and the SEP. 

While the results in this paper are general, we use them 
to study bit-interleaved coded modulation (BICM) ifTSjl-lfTTl. 
which can be viewed as a pragmatic approach for coded 
modulation. The key element in BICM is the use of a 
(suboptimal) bit-wise detection rule, which was cast as a 
mismatched decoder in tl8J . BICM is used in almost all of the 
current wireless communications standards, e.g., HSPA, IEEE 
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802.1 la/g/n, and the DVB standards (DVB-T2/S2/C2). 

The BICM generaUzed mutual information (BlCM-GMl) is 
an achievable rate for BICM systems H8il and heavily depends 
on the binary labeling of the constellation. The optimality of a 
Gray code (GC) in terms of maximizing the BlCM-GMl was 
conjectured in |fT6l Sec. 111-C] for general GCs, constellations, 
and SNR values. This conjecture was later disproved in |fT9l : 
it was shown that for low and medium SNRs, there exist other 
labelings that give a higher BlCM-GMl (see also |[20i Ch. 3]). 
For further results on BICM at low SNR see ||2T]-|l24l. On the 
other hand, numerical results presented in ll20] Ch. 3] and ll25l 
suggest that GCs are optimal at high SNR. The proof of the 
optimaUty of GCs in terms of BlCM-GMl in the high-SNR 
regime remains as an open problem ll20] Sec. 3.3]. 

In this paper, we derive an asymptotic expression for the 
BlCM-GMl as a function of the constellation, input distribu- 
tion, and binary labeling. Using this expression, we then prove 
the optimaUty of GCs at high SNR. Using the Ml-MMSE 
relationship, an asymptotic expression for the derivative of 
the BlCM-GMl is also developed. The obtained asymptotic 
expressions for the BlCM-GMl and its derivative, as well as 
the one for the bit-error probability (BEP), are all shown to 
be proportional to Q (^y/pd/2y 

This paper is organized as follows. In Sec. [Ill the notation 
convention and system model are presented. The asymptotics 
of the Ml and MMSE are presented in Sec. [Ill] and BICM is 
studied in Sec. HV] The conclusions are drawn in Sec. |V] 

11. Preliminaries 

A. Notation Convention 

Row vectors are denoted by boldface letters x = 
[xi,X2, ■ ■ ■ ,xm] and sets are denoted by calligraphic letters 
C. An exception is the set of real numbers, which is denoted 
by K. The binary set is defined as S = {0, 1} and the bipolar 
set by W = { — The negation of a bit b is denoted 
by b. All the logarithms are natural logarithms and all the 
Mis are therefore given in nats. Marginal probability density 
functions (PDFs) and conditional PDFs are denoted by fy (y) 
and fY\xiy\x)^ respectively. Analogously, PMFs are denoted 
by Px{x) and Px\y i^^lv)- Expectations are denoted by E[-]. 

B. Model 

We consider the discrete-time, real-valued AWGN channel 
in ([T]i, where the transmitted symbols X are constrained to 
X e X ^ {xi,X2,...,xix\}, and where \X\ = M = 2™. The 
set of indices that enumerates all the constellation symbols in 
X is defined as Xx — {1, . . . ,M}. 

We focus on one-dimensional constellations and assume, 
without loss of generality, that the symbols are different and 
ordered, i.e., xi < X2 < ... < xm- Each of the symbols 
is transmitted with probability pi = Px{xi), < pi < 1. 
While the transmitted symbols are fully determined by the 
PMF Px, we shall use constellation to denote the support X 
of the PMF and input distribution to denote the probabilities 
[pi, . . . ,p]^j] associated with the symbols. We assume that 
neither the constellation nor the input distribution depends on 
P- 



The transmitted average symbol energy is finite and given 



by 



M 



Pixf. 
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It follows that the SNR 7 in Q is 7 = pE^. 

An M-ary pulse-amplitude modulation (7\/PAM) constella- 
tion having M equally spaced symbols (separated by 2A) is 
denoted hy £ = {x^ = -(M - 2i + 1)A : i = 1, . . . , A/}. A 
uniform distribution over X is denoted by P^, i.e., pi = 1/M 
Vi. A uniform input distribution with X = £ is denoted by 
P|-", where in this case = 3Es/{M'^ - 1). 

The Gaussian Q-function is defined as 



the entropy of the random variable X as 

Hp, ^ --EllogiPxiX))] 
the Ml between X and Y as 

\pAp)='E[log{fY\xiY\X)/fY{Y))] 
and the MMSE as 

MpAp)^n{x-x^^{Y)r] 



(4) 



(5) 



(6) 



(7) 



where X^^{y) = W,[X\Y = y] is the conditional (posterior) 
mean estimator 

We also define the SEP as 



SpAp)^MX''^'(Y)^X} 
where X is the transmitted symbol and 

l^^P(y)^argmaxPx|y(x|y) 



(8) 



(9) 



is the decision made by a maximum a posteriori probability 
(MAP) symbol demapper 

C. Discrete Constellations 

We define the set of symbol differences corresponding to 

Xi E X as 



^{xi-x:xeX} 



and the set of all symbol differences in X as 

ieix 



-,(■0 



(10) 



(11) 



The MED of the constellation is defined as 

d= min \5\ (12) 

5eVx\{o} 

and the maximum Euclidean distance (ED) of the constellation 

as 



d = max \5\. 
We further define the counting function 



'1, if,5e2?^^ 

0, if (5 ^ 2?^^ 



(13) 



(14) 
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Since is always an element of , we have A^^' (0) = 1 
Vi. 

Analogous to A^J^{S), we define Bp^ (6) as 



y/pj/pi, if 3xj G X : Xi — Xj = S 



0, 



otherwise 



(15) 



Clearly ^^.^^(O) 

We define Ax as twice the number of pairs of constellation 
points at MED, i.e.. 



ieix wew 



(16) 



By using the fact that for any real-valued constellation there 
are at least one and at most M—1 pairs of constellation points 
at MED, we obtain the bound 

2 < Ax <2{M - 1). (17) 

The upper bound is achieved by an A/PAM constellation 

A£=2(Af-l). (18) 

Finally, for a given Px, we define the constants 

and 

" ^ (20) 



For a uniform input distribution Px ~ Px ^nd Bp' ((5) 
A«(<5),so 



A 



X 

M ' 



(21) 



Example 1: Consider a nonequally spaced 4-ary constella- 
tion with xi ^ —5, X2 = — 3, X3 = 3, and X4 = 5, and the 
input distribution pi = i/10 with i = 1,2, 3, 4. In this case, 

2?^'' ={0,-2,-8,-10}, 



{2,0,-6,-8}, 



,(2) 
X 

f ={8,6,0,-2}, 



P^"' ={10,8,2,0} 

and Vx = {0, ±2, ±6, ±8, ±10}. The MED of the constella- 
tion in ([T2]i is d = 2, in (|3]i is £'s = 17, Ax in (O is 
Ax = 4 (two pairs of constellation points at MED), Np^ in 
^ is Np^ = 2^/pIp^ + 2^/pipI w 0.98, and Dp^ in (EOll 

is Dp^ = P2 + Pi + Pi + P3 = I- 

III. HiGH-SNR ASYMPTOTICS 

There exists a fundamental relationship between the MI and 
the MMSE for AWGN channels HI (see also HI Ch. 2]): 



(22) 



Exploiting this MI-MMSE relation, bounds on the MI can be 
used to derive bounds on the MMSE and vice versa. 



Upper and lower bounds on the MI and MMSE for discrete 
constellations at high SNR can be found in e.g., fSl-lTl, 
ll9l- lfT2l . While these bounds describe the correct asymptotic 
behavior, they are, in general, not tight in the sense that the 
ratio between them does not tend to one as p ^ 00. In what 
follows, we present exact asymptotic expressions for the MI 
and MMSE for an arbitrary Px- 

A. Asymptotics of the MI, MMSE, and SEP 

For any given input distribution Px, the MI tends to Hp^ 
as p tends to infinity. In the following we study how fast the 
MI converges towards its maximum Hp^ by analyzing the 
difference Hp^ — lpjf(/o)Q Theorem [T] is the main result of 
this paper and characterizes the high-SNR behavior of Hp,^ — 

Theorem 1: For any Px 



,. Hp„-lp„(p) 

lim ; = ttA'p^ 

P^- Q(^/pc^/2) 



(23) 



where Np^ is given in (fT9] l. 

Proof: The proof is given in Appendix |A] ■ 

Both numerator and denominator on the left-hand side 
(Lh.s.) of (|23lltend to zero as p tends to infinity. It follows from 
L'Hopital's rule that the limiting ratio in (|23] | is equal to the 
limiting ratio of the numerator's and denominator's derivatives. 
By the MI-MMSE relation in (|22]), Theorem [T] thus directly 
gives an asymptotic expression for the MMSE. 

Theorem 2: For any Px 



-ip) 



lim — , 

P^oo Q {^d/2] 



(24) 



where Np^ is given by (fT9] l. 

Proof: The proof follows by applying L'Hopital's rule 
to the l.h.s. of ( |23] | together with ( |22] |. by recognizing the 
derivative of Q (^^d/2) in (|23T l as — G {y/pd/2) d^/8 with 
G(a;) defined in ( |83] |. and by using ( |84| |. ■ 

The asymptotic expression for the SEP is stated in the 
following theorem. 

Theorem 3: For any Px 

lim -IeArL 
P^~ Q (Vpd/2) 

where Dp^ is given in ( l20b . 

Proof: The proof is given in Appendix |B] I 
Theorems [TKH reveal that, at high SNR, the MI, MMSE, 
and SEP behave as 



(25) 



lpx(p)« Hp^ -^iVp^Q 



V 2 



Mp,{p)^-7TNp,Q 2 , 



Sp,{p)^Dp,Q 



V 2 



(26) 



(27) 



(28) 



The results in (I26]l-(l28ll show that for any input distribution, 
the MI, MMSE, and SEP have the same high-SNR behavior, 

'The quantity Hp,^ — Ipx(P) corresponds to the conditional entropy of 
X given Y. 
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i.e., they are all proportional to a Gaussian Q-function0, where 
the proportionality constants depend on the input distribution. 

Remark 1: While the results presented in this section hold 
for any one-dimensional constellation, they directly gener- 
alize to multidimensional constellations that are constructed 
as ordered direct products 1231 eq. (1)] of one-dimensional 
constellations. For example, the results directly generalize to 
rectangular quadrature amplitude modulation constellations. 



10 
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B. Discussion and Examples ^ 

For a uniform input distribution (Px = Px), Theorems [lj{3] ^ 
particularize to the following result. 

Corollary 1: For any X with a uniform input distribution 

logil/- Ipu (p) 



lim 



Q (05d/2) 



M ' 



(29) 



lim 

Q (Vpd/2) 

Spu (p) 
lim , , 
Q (Vpd/2) 



Ax 

-TT 

4 M 



-TT-^, (30) 



M 



(31) 



where Ax given in (fTSI l. 

Proof: Follow directly from (l2ll . ■ 

The expression (1311 1 corresponds to the well-known high- 
SNR approximation for the SEP EH eq. (2.3-29)]. Moreover, 
Corollary[T|shows that for a uniform input distribution, the MI, 
the MMSE, and the SEP for discrete constellations in the high- 
SNR regime are functions of the MED of the constellation and 
by the number of pairs of constellation points at MED only. 

For A/PAM and a uniform input distribution {Px = ^x"), 
Corollary [T] particularizes to (see (ITSl ) 



lim 



logM- Ipcu(p) 
Q (Vpd/2) 



and 



Mpeu(p) 
lim y-^ r 

Q (^d/2) 



~ TT lim 7 

P^-> Q (Vpd/2) 
2(M - 1) 



A/(M + 1) 



(32) 



(33) 



(34) 



10^ 



10 ' 



10 




Exact 

— — Asymptotic 
-O- LB 

-□- UB a 

-V- LB O 
-A- UB fT2l 
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Fig. 1. logM — lp|.u(p) for 4PAM and 16PAM (solid lines) constellations 
(normalized to Es = 1) and the asymptotic expression in (33) (thick dashed 
lines). The lower and upper bounds g] eq. (34)-(35)] and QI] eq. (17)-(19)] 
are also shown. 
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In Table |I] we show a summary of the results obtained in 
this section, including (|32]|-(|34]|. 

Example 2: In Fig. [T] we show the conditional entropy 
log M - I piu (p) for 4PAM and 16PAM with uniform input 
distribution^ together with the asymptotic expression in (|33l l. 
We also show the lower and upper bounds derived in 10 
eq. (34)-(35)] and IH eq. (17)-(19)]. Observe that ^ 
approximates logM — Ipou(p) accurately for a large range 
of SNR. In Fig. |2l analogous results for the MMSE are 
presented, where the bounds derived in ||6] eq. (30)-(31)] and 
lfT2l eq. (13)-(15)] are also included. Again our asymptotic 
expression ( l34l i approximates the MMSE accurately for a large 
range of SNR. 

^Disregarding the "offset" Hp^ in (26). 

^Calculated numerically using Gauss-Hermite quadratures with 300 
quadrature points 1251 Sec. III]. 



Fig. 2. Mpou(p) for 4PAM and 16PAM (solid lines) constellations 
(normalized to = 1) and the asymptotic expression in 134) (thick dashed 
lines). The lower and upper bounds (6] eq. (30)-(31)] and (12| eq. (13)-(15)] 
are also shown. 



Remark 2: It follows from Corollary [T]that the constellation 
that maximizes the MI (or equivalently, the constellation that 
minimizes the MMSE and the SEP) at high SNR is the constel- 
lation that first maximizes the MED and then minimizes Ax- 
For one-dimensional constellations, the MED is maximized by 
an A/PAM constellation {X ^ £). 

We conclude this section by noting that if Theorems [T| and 
I2] are combined, we obtain 

Mp,(p)_^ d2 

4 ■ 



lim 



(35) 



Thus, for any Px, the limiting ratio between the MMSE and 
the conditional entropy is constant, regardless of the input 
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TABLE I 

Summary of asymptotics of MI, MMSE, and SEP. 



Input Distribution 



lim 



Hi 



Q {Vpd/2) 



Mp^.(p) 



lim , , 



lim^ST 

P^°°Q(ypd/2) 



Ax 
M 



_2(M - 



M 



T^^^- T^ir ^M(M + 1) 



2(Af - 



71/ 



distribution. Moreover, using Theorems [T] and |3] we obtain 



lim 



Spx ip) Dp^ 
which for a uniform input distribution particularizes to 
logM- Ipu(p) _ 



lim 

p—¥00 



(36) 



(37) 



Thus, for any Px, the limiting ratio between the conditional 
entropy and the SEP is constant, which for a uniform input 
distribution equals tt. 

IV. Application: Binary Labelings for 
Bit-Interleaved Coded Modulation 

We next study the high-SNR behavior of BICM systems 
ifTSI - lflTl . In such systems (see Fig.[3]l the bits are mapped to 
constellation symbols using a mapper $ : — > X. Using the 
results in Sec. |III1 we will find an asymptotic expression for 
the BICM-GMI and we will study the relationship between the 
BICM-GMI and the BEP. We will also prove the asymptotic 
optimality of GCs in terms of BICM-GMI for one-dimensional 
constellations with uniform input distributions. 

A. BICM Model 

A binary labeling for a constellation is defined by the 
vector I = [hjh, ■ ■ ■ Jm] where li G {0, 1, ... , M — 1} is 
the integer representation of the ith length-m binary label 
9i = ■ ■ ■ , Qi.m] G 'B™ associated with the symbol .t,;, 
with 1 being the most significant bit. The labeling defines 
2m subconstellations X^^b C X for fc = 1, . . . , m and b € B, 
i.e., Xk^b ^ {x^ G X : 'q,,k = b} with \Xk,b\ = M/2. We 
define Xx,. ^ C {!,..., M} as the indices of the symbols in 
Xk,b- 

In BICM, the coded bits Q = [Qi,Q2, ■ ■ ■ , Qm] at the input 
of the mapper (see Fig. |3]l are assumed to be independent but 
possibly nonuniformly distributed. Therefore, the vector of bit 
probabilities [Pq^ (0), Pq^ (0), . . . , Pq^ (0)] induces a symbol 
input distribution Px via the labeling as 1231 eq. (31)] 1271 
eq. (8)] 



k=l 



Using 



we obtain the conditional probabilities 

if X e Xk,b 



Px\QMb) 



Pxjx) 



.0, 



if a; ^ Xk 



(38) 



(39) 



for k ~ l,...,m and b £ B. According to 
(i39l l. each of the 2m conditional input distributions 
[Px\Q^{xi\b), . . . ,Px\QAxM\b)] has AI/2 non-zero proba- 
biUties, which specify which of the M/2 symbols in X are 
included in Xk^b- 

For uniformly distributed bits, i.e., Pq^ (6) = 1/2, it follows 
that the symbol distribution is also uniform, i.e., Px = Px, 
and thus. 



Px\QA^i\b) 



fj, \f Xi& Xk.b 

0, if Xi i Xk.b 



(40) 



We shall use Xkfi to denote a random variable with support 
Xk.b and probabilities Px\QAx\b) for x G X^^b in (i39l l. The 
corresponding PMF is denoted by Px^ b and the PMF for the 
uniform case in ( |40l i is denoted by Pj^^ ^. 

In what follows, we will apply the results of Sec. HII] to 
BICM. To this end, we will often replace X and Px in Sec. Hill 
by Xk^b and Px^ ^, respectively. Note, however, that d and d, 
as defined respectively in SlTi and (ITJl l. still denote the MED 
and maximum ED of the constellation X. We will not consider 
the MED or maximum ED for subconstellations. This implies 
that it is possible that no pairs of constellation points in X^^b 
are at MED. Consequently, the bounds in (ITTl i become 



0<^;t,, <2(A//2-l) 



(41) 



Moreover, when subconstellations are considered, Np-^ in 
iT% becomes 



[wd] 



(42) 



and for a uniform input distribution. 



Np^ = 



M/2 



(43) 



Example 3: In Fig. [H we show the 2™ ~ 6 subconstella- 
tions for an SPAM constellation labeled by the binary-reflected 
Gray code (BRGC) I = [0, 1, 3, 2, 6, 7, 5, 4] lH-lll, as weU 
as the corresponding values of Ix^ b and Ax^, ^ ■ 



B. Binary Labelings for BICM 

The natural binary code (NBC) l23l Sec. II-B] is defined 
as the binary labeling I where U = i — for i = 1, 2, . . . , M. 
The NBC is an important labeling for BICM because it is 
the unique optimal labeling for BICM in the low-SNR regime 
for X ~ £ and uniformly distributed bits ll23l Theorem 14]. 
These results were extended to arbitrary distributions in [l24l . 
A labeling I is said to be a GC if for all z,j such that \xi — 
Xj I = d, the binary labels and Qj are at Hamming distance 
one. One of the most popular GCs is the BRGC l28l-ll30l. 
which we showed in Example [3] for AI = 8. 

To characterize binary labelings we define 



^-^' = EE E E4!,(w) 



(44) 



k,b 



k=i beB ieix. . wew 
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BICM Encoder 



Z 



BICM Decoder 



ENC 


► 


n 












Y 




$-1 


— ► 


n-i 


— ► 


DEC 




*■ 







Fig. 3. A BICM scheme: The BICM encoder is fornied by a serial concatenation of a binary encoder (ENC), a bit-level interleaver (11), and a memoryless 
mapper ($). The BICM decoder is based on a demapper ('J>~^) that computes logarithmic likeHhood ratios, a de-interleaver (H"^), and a channel decoder 
(DEC). 
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Fig. 4. Subconstellations X^^t for SPAM labeled by the BRGC I = [0, 1, 3, 2, 6, 7, 5, 4], Ax = Cx,i = 14. The values of Ax^ ^ and Xx^ ^ are also 
shown. 



and 



k=ibeBieix^ ^ wew 
where for a uniform input distribution {pi = 1/M) 



Ep"i — 



mM' 



(45) 



(46) 



For a given subconstellation Xk,b, the two inner sums in 
(|44] | consider all the constellation points in the subconstellation 
Xj^ 5 at MED from Xi G X^^b- Thus, the quantity Cx,i 
corresponds to twice the total number of different bits between 
the labels of constellation symbol pairs at MED. Using this 
interpretation, it follows that (l44l can also be expressed as 



obtain that for any X and I 

Cxj > Ax. (48) 
For X = £ and the NBC, Cx.i can be expressed as 



k=l 



2{2M - m - 2) 



(49) 



which is obtained by noting that, for each k, there are 2^^ — 1 
symbols satisfying qi^k qt+i,k, for i 1, 2, . . . , i\/ - 1. 



C. Asymptotic Characterization of BICM 



Cx.i — 



k=l 



{Ax - Ax^ a - Ax^ i] 



(47) 



where Ax — Ax^g — Ax^i corresponds to twice the number of 
pairs of constellation points at MED with different labeling at 
bit position k. For example, for the constellation and labeling 
inEig.g] Cx^ = U = Ax. 

While Ax in ( fTSI l depends only on the geometry of the 
constellation, Cx.i in (l44l i depends on both the geometry of 
the constellation and the labeling. By noting that any pair of 
constellation points at MED will differ in at least one bit, we 



The BICM-GMI is an achievable rate for BICM (181 and is 
one of the key information-theoretic quantities used to analyze 
BICM systems0 For any Px and I, the BICM-GMI is defined 
as El eq. (24)] 

m 

~\p,,i{p) ^ mlp, (p) - E E (^)I^-.,. (50) 



'^Note that even though the BICM-GMI is fully determined by the bit 
probabilities [Pq-i{0), Pq^iO), . . . , Pq^^{0)], we express it as a function 



of the input distribution Px in (38). 
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and twice it derivative a^ 
Mpx,i(p) = 2 — 



(51) 



= mMp, (P) - E E WMi'x,,, (P)- (52) 

k=l beB 

For the particular case Px = Px we obtain 



Tp„ A (p) - - E E l^i. . (z^)' (53) 

^ m 

ilp.,Kp) ^ mMp. (p) - 2 E E Mpi, , (P)- (54) 



fc=i 6e6 



Like the Ml, the BICM-GMI also tends to H as p tends 
to infinity. The following theorem shows how fast \px,iip) 
converges to Hp^ . 

Theorem 4: For any Px and I 



Hpx -\pxAp) 



lim 

P^^ Q (Vpc?/2) 



fc=l ^ bsB ^ 



(55) 

where iVp^, and A'^p^ are given in (dill and dllll, respec- 
tively. 

Proof: The proof is given in Appendix |C] ■ 
The following theorem gives an asymptotic expression for 

Mpx.i(p)- 

Theorem 5: For any Px and I 



lim 



P^ooQ(^d/2) 4 

(56) 

where Np^ and -/Vp^^ ^ are given in iT% and (|42] |. respec- 
tively. 

Proof: By using ( |52] ) and Theorem |2] ■ 
In analogy to ((HI)-®, we define the BEF0 as 

BpxAp) = - E Pr{^r'(>') ^ Qfc} (57) 



fe=l 



where Qfc is the transmitted bit and 



)r'(y)^argmaxPQ,|,.(6|y) 

beB 



(58) 



is the decision made by a MAP bit demapper. The next 
theorem gives an asymptotic expression for the BEP in (l57t- 



(59) 



Theorem 6: For any Px and I 



p— J-CXD 



Q 



where Ep^ i is given in 

Proof: The proof is given in Appendix |D] ■ 

'since the BICM-GMI is not an MI, its deiivative is not an MMSE [31]. 
We thus avoid using the name MMSE, although we do use an MMSE-Uke 
notation Mp^ i{p). 

^Note that (37) is the BEP averaged over the m bit positions, in contrast 
to the BICM-GMI in (Soji, that is a sum of m bit-wise Mis. 



Similarly to (|26]l-(|28Tl. we can use Theorems to show 
that, at high SNR, the BICM-GMI, twice its derivative, and 
the BEP behave as in (|6Q]l-(l62]i (see next page). 

The results in (|60]|-(|62]| show that in the high-SNR regime, 
the BICM-GMI, its derivative, and the BEP have the same 
asymptotic behavioiQ, i.e., they are all proportional to a 
Gaussian Q-function that depends only on the MED of the 
constellation. 

For a uniform input distribution. Theorems ^^particularize 
to the following result. 

Corollary 2: For any X and I and a uniform input distri- 
bution 



logil/-lpu,j(p) Cx.i 

lim ^ = TT , 

P^o° Q(7pd/2) M 

lim — r = — tt 

p^o°Q(v^d/2) 4 M 

lim ^-■^ 



(63) 



(64) 



(65) 



p^oo Q mM 

where Cx.i is given in (l44l) . 

Proof: The expression in ( l63T l follows from Theorem 2] 
combined with (EJ, dlS, (H?]), and Pq,,(5) = 1/2, the one 
in (ImI from Theorem |5] combined with dlB, (BD, and 
PQk (^) = l/2> ™d the one in i65[ from Theorem |6] and (l46l l. 

■ 

The results in Corollary |2]indicate that, for a uniform input 
distribution, a maximization of the BICM-GMI is equivalent 
to a minimization of both its derivative and the BEP. The 
asymptotic results for BICM are summarized in Table 

D. Optimality of Gray Codes 

To study the asymptotic behavior of the BICM-GMI for 
different labelings I, we introduce the two functions 



IPx - IP:x 

A MPx,i(P) 



(66) 
(67) 



Mp.(p) 

Noting thatTp^,j(p) < Ip^(p) lUSl eq. (16)], El Theorem 5], 
we have 

KL,i(p) > 1- (68) 



We further define 



Rpx,; = lim Kp 

p— >-oo 

= lim K5^^,,(p) 



(69) 
(70) 



where (fTOl i follows from L'Hopital's rule. Theorems [T] and 2] 
yield 

T.Zi{Npx-i:beBPQdb)Np,J 



R 



Px.l = 



and because of 



Npx 

Rpx,i > 1- 

^Disregarding the "offset" Hp^ in (60). 



(71) 
(72) 
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TABLE II 

Summary of asymptotics of the BICM-GMI, twice its derivative, and the BEP. 



Input Distribution 




Px 


pu 


^.^Hp -lp,,(p) 
P^oo Q (ypd/2) 


m y 

fc=i ^ 




Cxi 
" M 


lim ^/-'(^^^ 


fe=l ^ 






lim '^--'^^^ 






mM 



In the rest of this section, we study Rpx,j in dTTI ) for 
a uniform input distribution P\. With a slight abuse of 
notation, we will refer to Rp^.i as Rx,i- We shall say that, 
for a constellation X and a uniform input distribution, a 
labeling i is asymptotically optimal (AO) in terms of BICM- 
GMI if it satisfies Kx.i = 1- Intuitively, this can be under- 
stood as follows. The condition R;^,; = 1 guarantees that 
limp^oo Kp^ j(p) = 1, and because of ( |66] ), an AO labeling is 
then a binary labeling for which the BICM-GMI approaches 
as fast as the MI does for the same constellation X. 

Corollary 3: For any labeling I and constellation X, Rx.i 
can be expressed as 

Cx,i 



R 



x.i = 



A 



(73) 



X 



Proof: Follows by combining (|66] l and i69i with ( |29] l and 
(HHi. ■ 

Example 4: For an A/PAM constellation, R^.j can be ex- 
pressed using ( fTSI l and ( |73] | as 



R 



(74) 



2(A/- 1) 

If the labeling is the NBC, we further use ( |49] l to obtain 

_ 2M-m~2 

^£,Inbc - ]\J _l 

which fulfills Rf.iNBc > 1 for > 1. We thus conclude that 
the NBC for AfPAM is not an AO labeling. 

The following theorem demonstrates that GCs are AO at 
high SNR, and thus, it proves the conjecture of the optimality 
of GCs in terms of BICM-GMI [l6l Sec. III-C]. 

Theorem 7: For any constellation X and a uniform input 
distribution, a labeling is AO if and only if it is a GC. 

Proof: For any GC, all the pairs of constellation points 
at MED are at Hamming distance one. Thus, ( |48] l holds with 
equality, and therefore, Rx.i = 1- This completes the "if" 



part of the proof. The "only if" part follows from that for 
any non-GC, there is at least one pair of constellation points 
at Hamming distance larger than one, thus, Cx,i > Ax, and 
therefore, Rx,i > 1- ■ 

Remark 3: The results about the optimality of GCs directly 
extends to multidimensional constellations that are constructed 
as direct products of one-dimensional constellations, provided 
that the labeling is generated via an ordered direct product 
of GCs. This construction of constellation and labelings was 
formally used in e.g., [23, Theorem 15]. 

Remark 4: The NBC was shown in Example|4]not to be AO 
for an AfPAM constellation. However, if the NBC is used with 
a nonequally spaced constellation, it may become AO. This is 
the case for example if the NBC is used with the constellation 
in Example [T] in which case the NBC is a GC. 

We next derive an upper bound on Rx.i- 

Theorem 8: For any one-dimensional constellation and any 
labehng I 



and thus. 



Cx,i < min {niAx, (m - l)Ax + M) (76) 



min {mAx , (m ~ l)Ax + M) 
^x.i < z ■ (//) 

Ax 

Proof: We note that for any labeling there are exactly 
AI/2 pairs of labels at Hamming distance m. Because of this, 
at most AI/2 pairs of constellation points at MED can each 
differ in exactly m bits, which can be the case only if Ax < 
M. This case gives Cx,i < mAx- If there are more than Af/2 
pairs of constellation points at MED, i.e.. Ax > M, Af/2 
pairs can differ in m bits and the remaining {Ax — M)/2 
pairs can differ in at most m — 1 bits, which gives Cx,i < 
mM + {m—l){Ax^ M) = {m—l)Ax + M.T\ie expression 
in dTTli follows from (|76ll and dlSi. ■ 



k=l 



k=l 



beB 



Bp,,,(p)«f?p,.Q 



(60) 

(61) 
(62) 
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For an AfPAM constellation, using dTsl l. Theorem [8] yields 

Af-2 



R 



£,l 



< m 



2M -2' 



(78) 



and K^. 



Example 5: In Fig. |5] we show the functions K'pou;(/o) 
i{p) in (l66T l and (|67] |. respectively, for a 4PAM 
constellation with a uniform input distribution {Px = f*™' 
A^t" = 6) and the three labelings that give different BICM- 
GMI; h = [0, 1, 3, 2], h = [0, 1, 2, 3] and h = [0, 3, 2, 1]. The 
values of Rf.i in (|74|l are also shown. Unlike the BICM-GMI 
curves usually plotted in previous works (see |fT9] Fig. 3] and 
EU Fig. 1]), the functions K^o„ and K% i{p) allow us 
to study different labelings at high SNR. Moreover, Kpou 
also allows us to study different labelings at low SNR: ^ig. \5\ 
shows that the NBC (i.e., I2), is the binary labeHng for 4PAM 
that gives the largest value for Mpu as p tends to zero, 
cf. 1221, 1231 Theorem 14]0 Observe that the GC (i.e., h), 
as expected, gives Rgj^ = 1, and that ^3 achieves the upper 
bound in CM, i.e., Rsj^ = 5/3. 

Example 6: In Fig. |6] we show the function Kpcu i{p) in 
dSZli for SPAM (Px P™, Ax = 14) and alf the 458 
labelings that give a different BICM-GMI l25l. In this figure, 
12 possible values of ^ in ( |74l ) are clearly visible, which 
coincide with the results in ||25] Fig. 3]0 Using ( |65] l, the 12 
values of R^ ^ in Fig.|6]also translate into 12 different asymp- 
totic BEP curves, which were recently reported in [l32] Fig. 4]. 
The value R^.j^^c obtained using (fTSl l is also shown. A careful 
examination of Fig. |6]reveals that there are three labelings min- 
imizing Rf These are the three nonequivalent GCs (in terms 
of BEP) 1^ Table I]: the BRGC I = [0,1,3,2,6,7,5,4], 



10 



^ 10 



^ 10 



10 



NBC 



GCs 



0? 



O0?9?oo 



?9 



?Q Upper Bound 



1.5 



2.5 



3.5 



Fig. 7. Approximated PrjRf ; = r} using lO'' randomly generated 
labelings for 16PAM (normalized to Eg = 1). For GCs Rg^i^^ = 1 and 
for the NBC Rf.iNBC ~ 26/15. The upper bound in l l78t is also shown. 



I = [0,1,3,2,6,4,5,7], and I = [0,1,3,7,5,4,6,2]. 

Example 7: Motivated by l23l Fig. 6], we present in Fig. |7] 
an approximation for the PMF PrjR^t'.i = r} for 16PAM 
obtained by randomly generating 10^ labelings. This figure 
shows that most of the possible labelings are not Gray. For 
M = 16, we obtain Rs.i^^c = 26/15, cf. which is 

highlighted in Fig. |2l The upper bound in ( |78] l is also shown. 
In the next section, we will show how to construct a labeling 
that achieves this upper bound. 



'^The relationship between the coefficient a determining the low-SNR 
regime for a zero-mean constellation with a uniform input distribution |23l 
eq. (47)] is a log 2 = lirrip^o ^pe^ lip) (see also (13] eq. (86)]). 

^T<t^rtUar n^^tcl tli.it lim ^ U''^ 



Further note that limp_>o j (p) reveals the 72 classes of labelings 
reported in 1231 Fig. 6 (a)]. 



E. Anti-Gray Codes 

In the previous section, we have seen that GCs minimize 
R^t'.i- In this section, we show that, for A/PAM constellations. 
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it is always possible to construct a labeling that maximizes 
Rs^i, i.e., a labeling that achieves the upper bound in ( |78] i. 

We define the set of all possible values that Cx,i can take 
as Cx, where 

\Cx\<^ mill {(77Z - I) Ax + 2, {m - 2) Ax +M + 2} 

(79) 

which follows from the fact that Cx.i is an even integer 
bounded by (|48]l and ( |76] l. 

The expression in (|79] l is an upper bound on the number of 
classes of labelings with different high-SNR behavior in terms 
of BICM-GMI or BEP. For the particular case of X ^ £, by 
using ( fTSl l in (|79] l, we obtain 

3 A// 

\C£\<mM - — -m + 3. (80) 

For 4PAM |Cf | < 3 and for SPAM \C£ \ < 12, which coincides 
with the 3 and 12 classes for high SNR shown in Fig. |5] 
and Fig. |6l respectively. For 16PAM, the upper bound ( |80] l 
indicates that \Cs \ < 39, however, the results in Fig. Q show 
only 37 classes. This raises the question of the tightness of 
the bound in dSOl l (or equivalently, the upper bound in (fTsTl), 
which we address in what follows. 

The anti-Gray code (AGC) of order m > 2 is defined by the 
M X m binary matrix W„j, where the ith row is the binary 
label for Xi, where Wi = [0, 1]^, and where the following 
steps construct W„i from W,„_i: 

Step 1 Reverse the M/2 rows in W„j_i, and append them 

under W,„_i to construct a new matrix W'^, with M 

rows and m — 1 columns. 
Step 2 Append the length M column vector 

[0,1,0,1,...,0, 1]"^ to the left of W;, to create 

W^, with M rows and rn columns. 
Step 3 Negate all bits in the lower half of W"j to obtain Wm. 

The recursive construction described above is illustrated in 
Fig. m for m = 2 and m = 3. The following lemma shows 
that this construction indeed leads to a valid labeling. 

Lemma 4: All the rows in W,„ are unique, and thus, the 
AGC is a valid labeling. 

Proof: Consider the above construction of an AGC. Wi is 
a valid labeling in the sense that all rows are unique. Because 
of Step 1 every odd row in the upper half of W'^ is identical to 
an even row in the lower half of WJ„, which directly implies 
that all rows of W^J^ in Step 2 are unique. Thus, is a 
valid labeling. Moreover, every odd row in the lower half of 
WJ^ differs in m bits compared to the row below. Inverting all 
the bits in these M/2 rows is therefore equivalent to swapping 
every odd row in the lower half of WJJ, with the row below, 
which makes W,„ a valid labeling with M unique rows. ■ 

The next theorem proves that, at high SNR, the AGC is the 
worst binary labeling for 7\/PAM constellations. 

Theorem 9: For X = £, the AGC achieves the upper bound 
in (|78ll, i.e., 

M -2 

f<SMoc=m~^^^. (81) 

Proof: Let iJ,„ — Csjaoc denote twice the sum of the 
Hamming distances between all adjacent rows in W^, and 



let i/'^j and H," denote the same quantity for and W^, 
respectively. Steps 1 and 2 give = 2H,„_i and iJ," = 
H'^ + 2{M-1). It then follows that H„, = H';^-2 + 2{m-l), 
since row Af /2 and row il//2 + l in W"j differ in only one bit 
and therefore the same rows in differ in m — 1 bits. This 
gives Hm = 2iJ„i_i + 2(M + m — 3), which combined with 
Hi =2 can be shown to give Hm = 2 (mM — m — ^ + l). 
Together with ( |74l l, this completes the proof. ■ 

The labeling ^3 in Example |5] and Fig. |5] (i.e., W2 in 
Fig. [g is the AGC for 4PAM with Rs^i =5/3 given by (gB- 
For SPAM, the AGC is Ugc = [0,7^2,5,6,1,4,3] (W3 in 
Fig.lUl, whose corresponding function Kpou i^^^^ip) is shown 
in Fig. |6l with R^,, = lS/7. 

For M = 16, the labeling that maximizes Rs i (Rg.i — 
106/30 w 3.53) is the AGC W4 (as shown by Theorem |9ll, 
which can be constructed as described before. It can be further 
shown that the labeling with the second largest ( {Rs.i ~ 
104/30 « 3.47) can be constructed by reversing the order of 
the three first rows of the AGC W4. This demonstrates that 
for 16PAM all 39 classes are indeed possible. The last two 
classes are not shown in Fig. [T] because the total number of 
labelings in this case is 16! w 2.1 • 10^"^ (without discarding 
trivial operations), so randomly generating 10^ labelings only 
covers a small fraction of all possible labelings. 



V. Conclusions 



In this paper, we studied discrete constellations with arbi- 
trary input distributions over the scalar AWGN channel in the 
high-SNR regime and derived exact asymptotic expressions 
for key quantities in information theory, estimation theory, and 
communication theory. The six quantities we studied are the 
MI, MMSE, SEP, the BICM-GMI, its derivative, and BEP. 
Our results show that, as SNR tends to infinity and disregard- 
ing offsets, all these quantities are proportional to Q {^^d/2^, 
where d is the MED of the constellation. These results show 
asymptotic equivalences between all these quantities as well 
as the importance of the Gaussian Q-function. 

For a uniform input distribution, the proportionality con- 
stants for the MI, SEP, and MMSE were found to be a function 
of the MED of the constellation and the number of pairs of 
constellation points at MED only, and thus, the constellation 
that maximizes the MI in the high-SNR regime is the same 
that minimizes both the SEP and the MMSE. 

We then applied our results to the problem of binary 
labelings for BICM. By characterizing the high-SNR behavior 
of the BICM-GMI, asymptotically optimal binary labelings 
were found, and the long-standing conjecture that Gray codes 
are optimal at high SNR was proved. We also proved that there 
always exists an anti-Gray code for A/PAM constellations, 
which is the labeling that has the lowest BICM-GMI at high 
SNR. 
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Fig. 8. Proposed recursive constraction of an AGC for m = 2 and m - 



Appendix A of (ISTT i. 
Proof of Theorem[T] 

We start by upper and lower bounding the Q-function via ^hat follows, we calculate the hmit on the rh.s. of 

Prop. 19.4.2] Using ^ and and substituting r = d^/p/8, we obtain 

^- ^)<^{x) <Q{x) <Gix), x>0 (82) lim , ^ 2 f lim FrM + lini F+(r)) (89) 



where 



where 

G(a;)^-^e-^. (83) , 



It follows that 

G(x) 



lim ^ = 1. (84) -log I i^W) e-^-*-^^^^ | dt (90) 



x—^oo 



2 



The numerator of the l.h.s. of (123b can be expressed as 
where 



and 

/>oo 



n 

/ 

(i) 



log I Y: «W)'e-^"^-^'^^^ I dt. (91) 



•log J2 (^PxW) e-v^*^-^ I dt. (86) 

The expressions in (l85T l and (|86] | are obtained from ||25] eq. (5)] 
and the definition of entropy. p- _ / 

Combining (l84| and yields * J- 



We begin with the first limit on the rh.s. of (1891 1. Using the 
substitution t = u/r — r, we express F~ (r) in (|90l ) as 



Y V. lim TTT^^. (88) ^ 

G [y/pd/2) where 



As will become apparent later, the limit on the right-hand side {7(ij)A^j^_j^j ^g^-j 

(r.h.s.) of dSST l exists and, hence, so does the limit on the l.h.s. d \d J 
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and U{5) > OyS £ Vx- Defining upper-bound ( l94l i as 

/; [r, u) < h{r^ - u)e^" log 1 + 



log 1+5: «W) c-^"^-^^^^(*M (94) 



<5eX'* 



(104) 

witii V* = V'§ \ {0} and h{x) being the Heaviside's step < e^" log 1 + ~ ~ ^^^^^ 

function (i.e., h{x) 1 if x > and h{x) if x < 0), \ ^' / 



Fi (r) in d92] i can be written as , , , , . 2,,,^-, 

' ^ ^ where to pass from (fT04t to (fT05^ we used e""'- '^('5) < 



^"W = / /r('^.«)d«. (95) 



g-4«c/(5) fQj. < ^2 (because C/((5) > 0) and that the rh.s. of 
( IIO5I 1 is nonnegative for u < . 



Note that (r, u) is a nonnegative function, and because m > 0, we have 

lim /r (r, 7.) = c^" log (l + fi?*.*' (d)) ' q-^" ) , « e 



/-(r,?.)<c^"log(^l + ^— e-""j (106) 

which is obtained by replacing 5^ in dlOSb by d^. For u < 0, 
(96) (1105b is upper-bounded by 
We will further show that, for every r > 0, /~ (r, u) is f M - \ 

uniformly bounded by some integrable function [u] that fr [r, u) <c2-log — c-4«'^/'^- (107) 

is independent of r (see Lemma |5] ahead). To compute the V Pi / 

first limit on the rh.s. of (|89), we can thus use Lebesgue's ^^^^^ ^^^^jj^g^ ^^-^^ ^ ^ ^/p^ ^ ^-iud^/d- by 
Dominated Convergence Theorem El Theorem 1.34] to ob- replacing 6^ in by d\ The proof of W follows from 
tain 

_ gr (^) = 5r (") + gr («) dw dos) 

° 1 M 

g-(ii)dii= -log— + — (109) 



r— i-oo / J —QO 



00 



where 

lim /,-(r,w)dw (98) 



r c2« log (l + (sf^ {d)) ' c-*") Au 
^-00 V ^ ^ / and 



(99) ^ ^ 



(100) ]^ g-(^)d^<y ^e-log(^l + — — e— j d« 



where ( |99] l is obtained from ( 196) . and (1 100b follows from using tt AI — 1 

2« fnffpthf>r with iRSl fn (d 979 111 =771/ — — (HO) 



the substitution x = e^" together with 1351 eq. (4.222.1)]. ~ 2 



which follows in analogy to (|99l)-( ll00b . 

The second limit on the rh.s. of ( |89] l c; 
the same lines by substituting t ~ u/r + r in ( |9TT i. which gives 



It thus remains to show that / (r, u) is uniformly bounded 
, ■ 11 r - / \ . • • J J . r The second limit on the rh.s. of (^89b can be computed along 

by some integrable function q- iu) that is independent of r. ' _, ^ 

We do this in the following lemma. 

Lemma 5: For any r > we have 

0<frir,u)<giiu), ueR (101) 



lim F+{r) = ^B^l{-d). (Ill) 
Combining ( fTOOb and (fTTTl i with dSSll and dM) yields 



where 5,^ (m) IS defined as p^oo Q ^ V ^^'^^ ^^'^ ^ 



c 



2ii 1 



W f Me"4urfV<i^^ ^<0 (112) 



Pi 

and satisfies g~ (u) > 0, u G M. and 



9i (^) c^"lo fl + J^^-ie""*"^ > (102) which, by ( fT9] l, is equal to uNp^. This proves Theorem[T] 

Appendix B 
Proof of Theorem[3] 

Using Bayes' rule, X^^^{y) in (|9]l is expressed as 



g~(w)d'u<oo (103) 
and where (i is given by (fTSl t. 



= argmax{/y|;f(y|a;)Px(.T)} (113) 
Proo/.- We use c'T^ < 1 and (sf {5)f < l/p, to = x,, if y e y^^^ip), ^ = 1, . . . , M (114) 
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where ^^(p) is the Voronoi region for the symbol Xi defined Replacing Sp^ (p) by ( |125l l in the l.h.s. of ( |25] l. we obtain 

X'^-'IP) ^ e R : >i,iel;,\ {i}} Q^y^ 

I P]fY\x[y\xj) ■ J (,) 



(115) T..,..P^T.sev.A-'mWpSV2) ^^^^^ 

= {ye^:Pf_^^{p)<y<PT\p)} (116) ""^"^ Q(Vprf/2) 

and /3rAP(^), for * = 1, . . . , M - 1, are the thresholds of the = lim Q(V^^/2) E.^x. K E^gw (^'^) ^^27) 

Voronoi regions yf'^^ip), P^^lip) ^ -oo and /Sfj^^ip) ^ "^"^ Q(%/pd/2) 

+00. Using dl 15t . the thresholdo /3f ^^(p) in dl 16l l are found where to pass from il26\ to (1127b we consider only the Q- 

by solving functions with the smallest argument. Combining (I127I I with 

f ^flMAP. ^ . (RMAPf ^, X (|20li Completes the proof. 

PijY\x[Pi [Pj\Xi) =Pi+lfY\x[Pi [P)\Xi+l) (117) 

Appendix C 

^hich gives PROOF OF THEOREMS] 

^MAP. N log(Pi+i/P») ^ ^{x^ + x,+i) ^^^^^ Using the expression for the BICM-GMI (|50ll, we have 

y/pix^-x,+i) 2 ■ u T / N 

lim ^^-r'^vt) 

On the other hand, the maximum likelihood (ML) symbol p-+oo Q 

demapper is defined as / Hp — Ip (p) 

— lim { m ^ 



^^■^ " Hp -Ip fp) 

= argmin{(y-V^a;)^}. (119) -EE^Q^(^)- ^ 



k=l beB 



Q (Vpd/2) 



From dm it is clear that (m - l)Hp,, - Er=i Efcee ^Q. (^)Hp. 



(128) 



2 

Combining jllSIl and (11201) . we obtain 
and thus, to prove Theorem |3] it is enough to study the SEP = " E E E ^Q" (^^^^IQ^ log Px\q, {x^\b) 



The third term on the rh.s. of ([128) is zero because 

m 



lim ml/V = 1 (121) fc=ibeB 



(129) 



for the ML symbol demapper. 

Using (1114b and (1116b . the SEP in ((Sj is expressed as 

SP.(P)= E^^'Pr{^^^r('^)l^ = -»} (122) =-E E^^'logT^T^ (130) 



E P. Q - vp^o '"Hp^, + E E log^Q^ (*.^) (131) 

(123) =mHp, + E?'»logn^Q'=(*,fc) (132) 

i£Xx k=l 



Y^P,(q( ~ ) = mHp, - Hp, (133) 

where to pass from (1129b to (1130b we used (l39T l. and to pass 



^ Q / \ (124) from (fT32] i to il33[ we used 

V 2 // To compute the first two terms on the rh.s. of (I128t . we 

f y/p6^\ change the order of summation and limit and apply Theorem [T] 

\S)Q{^-\ (125) 

to each term. This proves Theorem |4] 
Appendix D 

where Vi is defined as the set of distances between Xi and its PROOF OF Theorem[6] 

(one or two) closest symbols in X. BEP ^ expressed as 



'"Throughout this proof we assume, without loss of generality, that p is 
large enough so that Xi < P^^^{p) < Xi^i. 



= - E E p^P'i^ ^ yTip)\^ = ^0 

(134) 



m 

k=l ielx 
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where J^f/f (p) is the "bit-wise Voronoi" region for ^ [12 



{y e M : PQ,iY{q:.k\y) > 



given by dSSl ), i.e.. 

PQ^|Y(9i,fe|j/)}. In analogy with the proof of Theorem |3] 
the asymptotic BEP is obtained by considering the ML bit 
demapper instead and only symbols Xj at MED from Xi, i.e., 

lim^^ 



lim 

p— 



Q (Vpd/2) 



(135) 



lim 



Q 

\xi-Xj\^d 



EE 



E 

fe=l b£Bi€Xx,, 



Pi 



Q (Vpd/2) 

E 1 

|a.;i— a:j \—d 



(136) 
(137) 



where to pass from (I136t to (1137b we expressed the sum over 
i E Ix as a sum over b G B and over i G Ix^ ^ ■ The expression 
in ( |59] l is obtained by recognizing the innermost sum in (1137) 
as _{-d) + A^l^ _{+d) and by using (gUl. 
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